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POLYNOMIAL RETRACTS AND THE JACOBIAN CONJECTURE 


VLADIMIR SHPILRAIN AND JIE-TAI YU 


Abstract. Let K[x,y\ be the polynomial algebra in two variables over a field 
K of characteristic 0. A subalgebra R of K[x,y\ is called a retract if there is an 
idempotent homomorphism (a retraction, or projection) Lp : K[x,y\ —> K[x,y\ such 
that Lp{K[x,y\) = R. The presence of other, equivalent, definitions of retracts 
provides several different methods of studying them, and brings together ideas 
from combinatorial algebra, homological algebra, and algebraic geometry. In this 
paper, we characterize all the retracts of K[x,y] up to an automorphism, and 
give several applications of this characterization, in particular, to the well-known 
Jacobian conjecture. Notably, we prove that if a polynomial mapping ip of K[x,y] 
has invertible Jacobian matrix and fixes a non-constant polynomial, then ip is an 
automorphism. 


1. Introduction 

Let K[x, y] be the polynomial algebra in two variables over a field K of character¬ 
istic 0. A snbalgebra R of K[x, y] is called a retract if it satisfies any of the following 
eqnivalent conditions: 

(Rl) There is an idempotent homomorphism (a retraction, or projection) 
ip : K[x,y] — >• K[x,y] snch that ip{K[x,y]) = R. 

(R2) There is a homomorphism ip : K[x, y] ^ R that fixes every element of 
R. 

(R3) K[x, y] = i? © / for some ideal I of the algebra K[x, y]. 

(R4) K[x,y] is a projective extension of R in the category of A-algebras. In other 
words, there is a split exact seqnence 1 —> / —>• K[x,y] R ^ 1, where I is the 
same ideal as in (R3) above. 

Examples: K; K[x,y\-, any snbalgebra of the form K[p], where p G K[x,y\ is a 
coordinate polynomial (i.e., K[p, q] = K[x, y] for some polynomial q G K[x, y]). There 
are other, less obvious, examples of retracts: if p = x + x^y, then K[p] is a retract 
of K[x, y], but p is not coordinate since it has a fiber {p = 0} which is reducible, and 
therefore is not isomorphic to a line. 
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The very presence of several equivalent definitions of retracts shows how natural 
these objects are. Later on, we shall also comment on a very natural geometric 
meaning of retracts. 

In 0, Costa has proved that every proper retract of K[x,y\ (i.e., a one different 
from K and K[x,y\) has the form K\p] for some polynomial p G K[x,y\^ i.e., is 
isomorphic to a polynomial iL-algebra in one variable. A natural problem now is 
to characterize somehow those polynomials p G K[x, y\ that generate a retract of 
K[x^y\. Since the image of a retract under any automorphism of K[x,y\ is again 
a retract, it would be reasonable to characterize retracts up to an automorphism of 
K[x,y\^ i.e., up to a “change of coordinates”. We give an answer to this problem by 
proving the following 

Theorem 1.1. Let K[p] be a retract of K[x,y\. There is an automorphism of 
Ar[x, y] that takes the polynomial p to x + y ■ qioi some polynomial q = q{x, y). A 
retraction for A'['0(p)] is given then by x —*• x + y ■ q] y —> 0. 

Our proof of this result is based on the well-known Abhyankar-Moh theorem [|l|. 
We note in passing that Theorem 1.1 also yields a characterization of retracts of a 
free associative algebra K{x,y) (see Theorem 2.1 in the next Section 2) if one uses a 
natural lifting. In Section 2, we also make an observation on retracts of a polynomial 
algebra in arbitrarily many variables (Proposition 2.2). 

Theorem 1.1 yields another useful characterization of retracts of K[x,y]: 

Corollary 1.2. A polynomial p G K[x,y] generates a retract of K[x,y] if and only 
if there is a polynomial mapping of K[x,y] that takes p to x. The “if’ part is 
actually valid for a polynomial algebra in arbitrarily many variables. 

Although the form to which any retract can be reduced by Theorem 1.1 might seem 
rather general, it is in fact quite restrictive, and has several interesting applications, 
in particular, to the notorious 

Jacobian conjecture. [|Tl|] If for a pair of polynomials p,q E K[x, y], the correspond¬ 
ing Jacobian matrix is invertible, then K[p,q] = K[x,y]. 

For a survey and background on this problem, the reader is referred to . 

To explain how retracts of K[x,y] and the Jacobian conjecture are connected, we 
need one more 

Definition. A polynomial p G K[x,y] has a unimodular gradient if there are u,v E 
K[x, y] such that ^ ■ u + ^ ■ v = 1. 

If the ground field K is algebraically closed, then this is equivalent, by Hilbert’s 
Nullstellensatz, to the gradient being nowhere-vanishing. 

Now we are ready to establish a link between retracts of K[x,y] and the Jacobian 
conjecture by means of the following 
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Conjecture “R”. If a polynomial p G K[x,y] has a unimodular gradient, then K\p] 
is a retract of K[x,y]. 

Theorem 1.3. Conjectnre “R” implies the Jacobian conjectnre. 

In fact, Conjectnre “R” is stronger than the Jacobian conjectnre since there are 
polynomials that have a nnimodular gradient, but do not have a Jacobian mate. The 
simplest example is p = x + x'^y, this polynomial has a unimodular gradient, but 
this gradient cannot be completed to an invertible 2x2 matrix by the gradient of 
any other polynomial. 

Therefore, a statement equivalent to the Jacobian conjecture would be: if (p, q) is 
a Jacobian pair (i.e., the corresponding Jacobian matrix is invertible), then K\j)] is a 
retract of K[x,y]. This statement is formally weaker than the Jacobian conjecture, 
but is actually equivalent to it by our Theorem 1.3. 

There are several different ways of proving Theorem 1.3 using the characterization 
of retracts given in Theorem 1.1; we believe that the proof we give here (using Newton 
polygons) is particularly elegant. 

As a corollary, we show that the Jacobian conjecture is equivalent to other (for¬ 
mally) much weaker statements: 

Corollary 1.4. Either of the following claims is equivalent to the Jacobian conjec¬ 
ture. Suppose (p, q) is a Jacobian pair. Then: 

(i) for some coordinate polynomial g, K[x,y] = K[p] + < g >, where < p > is the 
ideal of K[x,y] generated by the polynomial g. 

(ii) for some coordinate polynomial g, K[x,y] = K[g] + < p >. 

Corollary 3.1 in Section 3 also seems interesting since it improves several known 
partial results on two-variable Jacobian conjecture. 


Conjecture “R” has the following geometric interpretation. Suppose we have a 
polynomial p G C[x,y] with nowhere-vanishing gradient. Then every fiber {p = 
c, c G C} is a non-singular curve on the complex plane. Fix a particular fiber, 
say, {p = Co}. If there is another polynomial q G C[x,y] such that every fiber of 
q intersects the curve {p = cq} transversally (i.e., without common tangent line) at 
exactly one point, then we can arrange a geometric projection of the plane onto the 
curve {p = Co} by sliding a point on a fiber {q = c} toward the intersection point 
of {q = c} with {p = Co}. This geometric projection will also be algebraic, i.e., C[p] 
will be a retract of C[x,y] in this case. 


As far as the Jacobian conjecture is concerned, the Jacobian condition ensures 
transversal intersection of any fiber of p with any fiber of q if (p, q) is a Jacobian pair. 
Moreover, by a result of Kaliman , we can restrict our attention to the situation 
where all the fibers of p and q are irreducible, i.e., are connected curves. The only 
problem is to show that there is a fiber of p which intersects every fiber of q at exactly 
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one point, i.e., that for some Cq G C, the system of equations p(a;, y) = Cq; q{x, y) = c 
has a unique solution for any c G C. Thus, we have: 

Corollary 1.5. (cf. [§). Suppose is a polynomial mapping of C[x,y] with 
invertible Jacobian matrix. If (p is injective on some line, then cp is an automorphism. 


We also note here that the aforementioned result of Kaliman calls for an ex¬ 
ample of a non-coordinate polynomial p G C[x, y] that has a non-vanishing gradient 
and all of whose fibers are irreducible. If there is no polynomial like that, then the 
two-variable Jacobian conjecture is true. 

Another application of retracts to the Jacobian conjecture (somewhat indirect 
though) is based on the “(p°°-trick” familiar in combinatorial group theory (see 
||1^ ). For a polynomial mapping (p : K[x,y] —> K[x,y], denote by if°°{K[x,y]) = 
n^i ?/]) fhe stable image of p. Then we have: 

Theorem 1.6. Let p be a polynomial mapping of K[x,y]. If the Jacobian matrix 
of ip is invertible, then either p is an automorphism, or p°°{K[x,y]) = K. 

Our proof of Theorem 1.6 is based on recent results of Formanek and Connell- 
Zweibel §. 

Obviously, if p fixes a polynomial p G K[x,y], then p G p^{K[x,y]). Therefore, 
we have: 

Corollary 1.7. Suppose (p is a polynomial mapping of K[x,y] with invertible Jaco¬ 
bian matrix. If p{p) = p for some non-constant polynomial p G K[x, y], then p is an 
automorphism. 

This yields the following re-formulation of the Jacobian conjecture: if (p is a polyno¬ 
mial mapping of K[x, y] with invertible Jacobian matrix, then for some automorphism 
a, the mapping a ■ p fixes a non-constant polynomial. 


2. Retracts of K[x, y] 

We start with 

Proof of Theorem 1.1. Let K[p], p G K[x,y], be a retract of K[x,y] (note that 
by a result of Costa 0, every retract of K[x,y] has this form). Without loss of 
generality, we may assume that p has zero constant term. 

Let the corresponding retraction be given by 0 : x —>• gi(p); y —>• q 2 {p) for some 
one-variable polynomials qi,q 2 - Again, we may assume that both qi and q 2 have 
zero constant term. 
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Then, since 0 is a retraction, polynomials qi{p),(l 2 {p) should generate K[p]. 
Suppose qi and q 2 have degree n > 1 and m > 1, respectively. Then, by Abhyankar- 
Moh theorem [^, either n divides m, or m divides n. Suppose deg{qi) = 
k ■ deg{q 2 ) for some integer k>l. 

Then we make the following change of coordinates: x ^ x = x — c-y^; U ^ V = Vi 
where the coefficient c G K* is chosen so that in the polynomial qi-c-q^, the leading 
terms would cancel out. 

In these new coordinates, our retraction acts as follows: (j) : x ^ qi — c ■ q^ = qi] 
y ^ q 2 = q 2 - Polynomials qi and q 2 are easily seen to be another generating set of 
K\p\, but the sum of degrees of qi and §2 is less than that of qi and g 2 - 

Continuing this process, we shall eventually arrive at a pair of polynomials one of 
which is zero. Denote the other one by h; then we must have K[h] = K[p], i.e., 
h = c ■ p for some c G K*. 

Thus, we have shown that for some automorphism if) G Aut{K[x,y]) (“change of 
coordinates”), the composition (pijj takes x to p, and y to 0. It follows that in 
these coordinates, p{x, y) = x + y ■ q{x, y) for some polynomial q. This completes 
the proof. □ 

Proof of Corollary 1.2. (1) Suppose p G K[x,y] generates a retract of K[x,y\. 
Then, by Theorem 1.1, for some automorphism G Aut{K[x,y]), the polynomial 
'fpij)) has the form x + y ■ q{x, y). Let 0 be a mapping of K[x, y] that takes x to x; 
y to 0. Then (j){'ilj{p)) = x. 

(2) We are going to prove the “if’ part for a polynomial algebra K[xi,Xn] in 
arbitrarily many variables. 

Let <p{p) = Xi- Consider the following mapping of K[xi, ..., x„]: : Xi —>• p; x* —*• 

0, i = 1, Then 

(1) 'ijj{ip{p)) = p. 

Denote q = tljp. Then, by (1), q{p) = p, which means g fixes every element of 
K[p\. Also, it is clear that p(iC[xi, ...,Xn]) = K[p\. Therefore, p is a retraction of 
iL[xi, ...,x„], and K[p\ a retract. □ 

Now we give a characterization of retracts of a free associative algebra K{x,y): 

Theorem 2.1. Let i? be a proper retract of K{x,y). There is an automorphism ij; 
of K{x,y) that takes R to K{v) = K[v] for some element v of the form x + y ■ 
g(x, y) + in(x, p), where w{x^ y) belongs to the commutator ideal of K{x, y). 

Proof. First of all, every element of the given form generates a retract of K{x,y). 
Indeed, the corresponding retraction is given by x ^ v] y —>■ 0 (this mapping will 
take tc to 0 as well). 
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Now we are going to show that every retract of K{x^ y) has the form K{v) = K[v] 
for some element v G K{x,y). From the dehnition (Rl) of a retract, we see that 
every retract of K{x,y) can be generated by two elements {(p{x) and ^p{y))- 

Another easy observation is that if R is a retract of K{x, y) with the corresponding 
retraction (p, then is a retract of K[x,y], and the corresponding retraction is 
Here denotes the image of R under the natural abelianization mapping 
a : K{x,y) — K[x,y] (note that the kernel of this mapping is the commutator 
ideal of K{x, y)), and (j)°‘ is the natural abelianization of (p. The best way to see it is 
to apply the definition (R2) of a retract. 

Upon combining these two observations with what we know about retracts of 
K[x,y], we see that generators of our retract R must be of the form vi{x,y) = 
qi{x, y) + Wi{x, y) and V 2 {x, y) = q 2 {x, y) +W 2 {x, y), where Wi, W 2 belong to the com¬ 
mutator ideal of K{x,y), and g" = hi{p), q^ = h 2 {p), where p = p{x,y) G K[x,y] is 
a polynomial that generates a retract of K[x,y]. 

Moreover, since should be equal to K\p\, we should have K[hi{p),h 2 {p)] = 
K[p], so that we are in a position to apply Abhyankar-Moh theorem. Repeating the 
argument from the proof of Theorem 1.1, we see that after applying an appropriate 
automorphism ip G Aut{K[x,y]) (“change of coordinates”), our pair {hi{p),h 2 {p) 
becomes {'ip{hi{p)),0). 

Since every automorphism of K[x,y] can be lifted to an automorphism of K{x,y) 
(see [Q, Theorem 8.5), we have shown that, after applying an automorphism if nec¬ 
essary, our retract R can be generated by two elements, one of which (call it u) 
belongs to the commutator ideal of K{x,y), and the other one (call it v) is of the 
form x + y ■ q{x,y) + w{x,y), where w{x,y) belongs to the commutator ideal of 
K{x, y), and the corresponding retraction is <p : x ^ v, y ^ u. 

Then, since 0" annihilates y^ (see above), we should have u = (p{y) belong to the 
commutator ideal of K{x,y). 

On the other hand, (p should £x every element of R, in particular, the element 
u. Suppose u ^ 0. Write u in the form u = mi + u, where mi is the sum 
of lowest degree terms, i.e., any monomial of u has degree greater than that of 
monomials in mi. Then, since u belongs to the commutator ideal of K{x,y), 
every monomial in mi depends on y. Therefore, the image of any monomial in mi 
under the endomorphism 0 is a sum of monomials whose degree is greater than that 
of monomials in mi (this sum might be equal to zero, but it does not affect the 
argument). The same applies to monomials of u. 

Thus, there is no way we can have (p{u) = w; this contradiction shows that m = 0, 
which completes the proof of the theorem. □ 

We conclude this section with an observation on retracts of a polynomial algebra 
in arbitrarily many variables. 
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Proposition 2.2. Let Rhe a. proper retract of K[xi, generated by polyno¬ 

mials pi,...,pn, n>2. Then pi, are algebraically dependent. 

Proof. Let 0 : K[xi, Rhe a retraction, so that 0(i?) = R. In particular, 

(j) restricted to R is an automorphism of R. If polynomials pi,..., were alge¬ 
braically independent, then a result of would imply that 0 is an automorphism 
of K[xi, ...^Xr] as well. In that case, we would have R = K[xi, ...,a:n], which means 
R is not a proper retract, hence a contradiction. □ 

We note that it is an open problem whether or not any retract of K[xi^ ...,a;n], 
n > 3, can be generated by algebraically independent polynomials. If this is the case, 
it would imply the positive solution of the well-known cancellation problem - see 0 
for discussion. 


3. The Jacobian conjecture 


First we prove that our Conjecture “R” implies the Jacobian conjecture. 


Proof of Theorem 1.3. Let p : x ^ p{x, y)] y ^ q{x, y) be a polynomial mapping 
of K[x, y] with invertible Jacobian matrix. Then, in particular, the polynomial p(x, y) 
has a unimodular gradient. 


If we assume that our Conjecture “R” is true, then p{x, y) generates a retract of 
iC[x, p], hence by Theorem 1.1, for some automorphism tf) G Aut{K[x, y]), polynomial 
ijj{p) has the form x + y ■ h{x, y). Therefore, upon combining p with if necessary, 
we may assume that p{x, y) itself has this form. 


Now we appeal to a result of concerning Newton polygons of a Jacobian pair 
{p{x,y),q{x,y)). The Newton polygon of a polynomial / = f{x,y) = '^OijX'^y^ is 
the convex hull of {{i,j)\ Oij 7 ^ 0} U {(0,0)}. 

The result of |j^ we need is that if {p{x,y), q{x,y)) is a Jacobian pair, but p : 
X —^ p{x,y)-, y —> q{x,y) is not an automorphism, then Newton polygons of p{x,y) 
and q{x, y) are radially similar. 


Now, by way of contradiction, suppose (p is not an automorphism. Look at the 
Newton polygon of p{x,y). We see that it has an edge of length 1, namely, the 
one between the vertices (0,0) and (1,0). It follows that the similarity ratio for 
Newton polygons of p{x, y) and g(x, y) is an integer; in particular, g(x, y) has the 
form CiX^ + y ■ f{x, y) for some q G JL, Ck ^ 0. 

Then replace the pair (p, q) with (p, q — CkP^)- This new pair clearly has the same 
properties as (p, q) does: it is a Jacobian pair, but the corresponding mapping is not 
an automorphism. However, the highest degree of monomials of the form x™ in the 
second polynomial has been decreased. 
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Therefore, we can repeatedly apply our argument (note that p(x, y) does not 
change), until we get a pair {p,g), where g has no monomials of the form x™. 
But in that case, the Newton polygon of g has no edges along the x-axis, hence 
Newton polygons of p and g cannot be radially similar. This contradiction com¬ 
pletes the proof of Theorem 1.3. □ 

Proof of Corollary 1.4. (i) Without loss of generality, we may assume that both 
p and g have zero constant term. Moreover, upon applying an automorphism to all 
polynomials under consideration if necessary, we may assume g = x. 

First we show that ii K[x,y] = K[p] -(- < x >, then the sum is actually direct. 

By way of contradiction, suppose we have 

m 

( 2 ) x-u = ^Ci-p' 

i=l 

for some non-zero polynomial u = u{x, y) and constants Cj G K. Since the left-hand 
side of (2) is divisible by x, the right-hand side should be divisible by x, too. This 
is only possible if p itself is divisible by x, but in that case, the Newton polygon of 
p would not have an edge along the y-axis, which contradicts p having a Jacobian 
mate (see e.g. [|1^). 

Thus, we have shown that K[x,y] = K\p] © (x). By the definition (R3) of a 
retract, this implies K[p] is a retract of K[x,y]. Therefore, by our Theorem 1.3, 
K\p,q\ = K[x,y]. □ 

(ii) Arguing as in (i), we get p ■ u = YllLi © ‘ this case, p cannot depend on 

y, so p = p(x); but then the equality K[x, y] = K[x\ + (p) is not possible. 

Thus, K[x,y] = K[x\ © (p). This clearly implies p = c • p for some c G K*. The 
result follows. □ 


Corollary 3.1. Let K be an algebraically closed field, and let (p, g), p, g G K[x, p], 
be a Jacobian pair. Suppose p has the form p = x + p for some g G K[x, p], which 
is divisible by a homogeneous polynomial. Then K[p, g] = K[x, p]. 


Proof. If p is linear, then we are done. Suppose p is non-linear, and suppose g is 
divisible by a non-constant homogeneous polynomial h. 

If h is divisible by x, then p itself is divisible by x. In that case, the Newton 
polygon of p does not have an edge along the p-axis, which contradicts p having a 
Jacobian mate (see 0). 

If h is divisible by p, then K[p] is a retract of iL[x, p]. If not, then there is c G K* 
such that the homomorphism x —>■ p; y —>■ cp takes h to 0. This is obviously a 
retraction, so again, K\p] is a retract of K[x,y]. 

Applying our Theorem 1.3 yields the result. □ 
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Van den Essen and Tntaj have shown that if a Jacobian pair (p, q) is of the 
form [x + hi, y + h 2 ), where both hi and /12 are homogeneons polynomials, then 
If is notable that onr Corollary 3.1 not only relaxes the condition 
on the form of polynomials, bnt, most importantly, onr condition is imposed on one 
polynomial only. 


To prove Theorem 1.5, we need the following resnlt of independent interest: 

Lemma 3.2. Let p : x —>• p{x,y)-, y —>• q{x,y) be a polynomial mapping of K[x,y] 
with invertible Jacobian matrix. Snppose ip{K[x,y]) contains a coordinate polyno¬ 
mial. Then p is an antomorphism. 


Proof. Upon composing (p with an antomorphism if necessary, we may assnme 
that X G p{K[x,y]). This implies K[p,q,x\ = K[p,q\, and, therefore, K{p,q,x) = 
K{j), q), where K{p, q) is the qnotient held of K\j>, q]. 

On the other hand, by a resnlt of [0, (p, q) being a Jacobian pair implies K{p, q, x) = 
K{x,y). 

Therefore, we have K{p,q) = K{x,y), which by Keller’s theorem [|^ implies 
K[p,q] = K[x,y]. □ 

Proof of Theorem 1.6. Snppose <p is not an antomorphism. Then, by Lemma 
3.2, (p{K[x,y]) contains no coordinate polynomials. In particnlar, the degree of any 
polynomial in (f{K[x, y]) is at least 2. By using inductive argument, we are going to 
show now that the degree of any polynomial in ip’^{K[x,y]) is at least (/c -|- 1); this 
will imply ip°°{K[x,y]) = K. 

Consider an algebra p^{K[x,y]) for some A; > 1. Since p is injective (this is 
ensured by the Jacobian condition), polynomials p^(x) and p^{y) are algebraically 
independent. 

If p restricted to p^{K[x,y]) were an automorphism of p’^{K[x,y]), then, by a 
result of 0, p would be an automorphism of K[x,y], contrary to our assumption. 
(Note that p’^{K[x,y]) is invariant under p since, by induction, p^{K[x,y]) C 
p’^~^{K[x,y]) implies p’^^^{K[x,y]) C p’^{K[x,y])). Hence, p \^k (K[x,y]) is not an au¬ 
tomorphism of p^{K[x, y]), and our previous argument yields that p’^~^^{K[x, y]) con¬ 
tains no coordinate polynomials of p^{K[x,y]). In particular, the degree of any 
polynomial in p^^^{K[x,y]) is greater than that of any coordinate polynomial in 
p^{K[x,y]). This completes the induction. □ 


A. van den Essen has pointed out to us that in the case iC = C, a more compli¬ 
cated (geometric) proof of the result of Theorem 1.6 was given by Kraft 0 , who 
also proved that if a polynomial mapping p of C[x, y] is not birational (i.e., p does 
not induce an automorphism of the quotient held C{x,y)), then the stable image of 
(p is a retract of C[x,y]. This yields a natural question - is the same true for any 
polynomial mapping? (cf. HIT 
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